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SXIMMARY 

I compare the transmiaaion/disequHibrium test (TDT) and affected sib pwr (ASP) test ujider a 
general aJgebwr, model describing a bi-alle1ic disease locUB- Assmning linkage to a bi-alleliu marker, 
I derive two bJtiOTnial probabilities, one for parental allele ' transiDissJon ' (P^) which detenttinea the 
magnitude of the TDT statistic (Xtat)> ^ second for identity-by-deacent (ibd) marker allele 
'Bharing ' (P^) which determines the ma^itude of the ASP test statistic ( j^^,). I also consider the ASP 
test applied to a completely polyTOorphic marker and demonstrate that the probability of ASP 
marker allele sharing (P,) is identical to observed for a bi-allelic marker in equilibrium with the 
disease locus. I present a general framework for determining the power of the TDT and ASP teat 
based on exprebsions for Pt, P^ and the proportion (N/F) of ascertained pe-renKs who are informative 
a.t the marker. Two previous analytic investigations of TDT power baaed on the work of Ott (1989), 
and Risch & Merikangas (1996) are shown to be special cases of this general framework. In addition, 
I show the relationship between the framework I present and a third analytic investigation of TDT 
power for multz-allelic markers baaed on the work of Sham & Curtis (1995). 

IKTBODT7CTIOI7 

Linkage haa been demonstrated between insulin-dependent diabetes mellitus (IDDM) and the 
insulin gene i^egion on chromosome llpl5.5 on the basis of linkage analysis by the transmission/ 
disequilibrium test or TDT (McGinnis ei al 1991 ; Spielroan pi al. 1993). Linkage was demonstrated 
at the insulin 5'VNTR, a hypervariabJe marker ths-t is extremely polymorphic, but whose VNTR 
alloloc fall into two tdoin aino oIooood in Cauooniann, thuc^ fornjing a natural hi n-lloHo ( 1 / ) mfi^rlTor. 
The + alleles were discovered to be positively associat-ed with IDDM in case-control studies (Bell 
«/ «?. 1984). Subsequent studies then demonstrated linkage in families collected for Genetic Analysis 
Workshop 5 (GAW5) by TDT analysis of GAW5 parents who were heteroKygons ( + / — ) under the 
STNTR bi-allelic categories (Spielman et aJ, 1993; see also Thomson et al 1989, Julier el al. 1991). 

The very strong evidence for linka^ provided by the TDT (x^ = 8.26, p < 0.005) was both 
surprising and pu^^Jing because identity*by -descent (ibd) sharing of 5'VNTR alleles in affected gib 
pairs (ASPs) yielded no evidence for linkage in the same GAW5 fannilieFi- Indeed, evidence for 
linkage was completely undetected or •hidden* because the proportion of alleles shared by ASPa 
did not exceed the null hypothesis value of 0.5 in two different types of ASP analysis. On one 
band, there was no increase in ASP allele sharing when the analysis included all GAW5 families in 
which both parents were Informative for any two lengths of 5'VNTR allele (Spielman et al. 1989; 
Ck>x Sl Spiolman, 1989). On the other hand, when the analysis included only those ASP parents 
wbo were evaluated by the TDT, narneJy those heteroKygous ( + /— ) when the 5'VNTR is con- 
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one affected sib is shared {n^) or * unshared' (ti^) by the second affected sib; thus TO, + 7iy = 7Za,p is 
the sample sdze for xli^p equals the number of triw* in the data set consisting of an informative 
parent and an ASP. 

tJnlike the ASP t.c9t which usually considers ASP allele sharing from parents InforTwatjvfj for 
any two marker alleles, the TDT only considers parents bet;er07*ygOUS for two partioular marker 
alleles (e.g. A/B only). For a aet of nuclear families, the TDT counts the number of times each 
A/B parent transmitted allele ^ or ^ to individual affected offspring. As shown by Spielman et al. 
(1993), the statiBtic for detecting linkage by the TDT is: 



where i\ and arc the number of inatancea in which fU[\ A/B paxent transmitted allele A or By 
respectively, to an individual affected offspring; and thus ttj^-HT^T) == ^tdt sample size for Xt^i- 

Note that the algebraic expressions for ;^;J^p and xtit identical in form. In eacsh x^> the de- 
nominator is tlie sample size of the data set. Thus, when sample aize (T^^p or Ti^t) is fixed, the 
denominator is constant and the magnitude of each x^ is determined only by the size of the squared 
difEerence in the numerator [{nn^n^)^ or («a^^u)^J' 

A key idea in this paper is that the magnitude of the numerator in each x* determined by a 
spenilic binomial probability. In the cose of xl:^,, this is the probability of ASP 'allele eharing' or 
/J,, i.e. the probability that a randomly ascertained parent of an ASP transmitted the same marker 
allele (ibd) to both affected sibs. In the absence of linkage, = 0.5. But when linka^ is present 

> 0.6, and the larger the value of (P,— O.G), the more ASPa that exhibit eJIele sharing (»„) and 
the higher the magnitude of ,yLp* Similarly, a second binomial probability denoted P^ {for prob- 
ability of 'allele trajtsmii^ft^*) determines the size oIxIm^ P^ is the probability that marker allelol 
was transmitted to a specific affected child, by a randomly ascertained A/B parent of on ASP. When 
linkage and disequilibrium are present, P^ =1= 0.5 and the larger the value of IP^-CSt, the greater 
the value of xl^x' 

Oeneral aigebrai-c rnodul of linkage 

At the beginning of Results, I give expressions forP^ and P^ based on the following general model 
A bi-allelic marker with alleles A and B is linked to a bi-allelic disease locus with diaease-predia- 
posJng allele D and non-predisposing allele d. The model aJlows any penetrance for the D/D, D/d 
and d/d genotypes (a. and y, respectively) such that IS^a^O, l^fi^Q and 1 ^ y > 0, and 
also a-saumes that no other locua underlies disease suBceptibility. The recombination fraction (0) 
between marker and disease locua is variable as are the population frequencies of the four marker* 
disease locus haplotypcs {f[AD) = c,,f(Ad) ^ c^,f(BD) ^ c,J{Bd) = c„ where r. + Ca + Ca+c, « 1]. 

Note that once the ha.plotype frequencies are apecifiied, the population frequency {p) of disease 
allele D is known {p =r c^-^c^l are the frequencies fm,l-m) of marker alleles A and B, respect- 
ively (7»,==c,-hC8;l-m =Ca + f:4). Furthermore, the coefficient of disequilibrium (d) equals 
c,c,-c.C5 and thus, when convenient, the haplotype frequencies can be expressed as = mp-^S, 
c, ^ = (l-.T»)jp-^, and C4 = (l-«»)(l-jj) + (^. 



3^ 



.01 



Based on derivations in Appendix I. oqwationa (1) and (2) show expresmons for P, and P, in terma 
of rtandfttd genetic variabJes for the general bi-aUelic model described above. Botb expressions 
iMSUme that parents are aacertsined through a randomly elected ASP. and each expreasion applies 
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Lto an ascertainerJ parent who is also beterozygoua AjU at a bi-allelic marker. P, ia the probability I 
that the parent trswisnaitted the same marlcer allele to both affected aibs. ia the probability that j 
the parent transmitted allele to a pnrtjciilar affected child.. \ 



Equation i 

Equation 2 

Note that the expressiong for and are aiitiilar ijn form. When there is no linkage, both 
expresFiions equal 0.5; but when linkage id present an amount is added to 0.5 which, in each ex- 
preaaion, is a product of three factors. In both cxpreatriona, the leflmost factor depends on the 
recombination fraction the middle factor on baplotype frequencies (c^,Ci,C3,C4) and the quan- 
tity (see Appendix I), and the rightmost factor on penetrances (a,;tf,y) and the frequency {p) of 
disease allele D. BccauHe they play analogous rolea in each expreagion, I denote the leftmost factor 
in Pg and Pt asi.Crg and L^, respectively ; nnd aimilarly denote the rightmoat factor as and R^, and 
middle factor as and i/^. Thua. P, ^ (^,b^L^U^B^ while P^ = CS+L^Jf^JJi. 

tPt-^'-^l > (^8-^'«^>) '^^^^'^ disegtviUbrium is eaareme 
As described in the Introduction, the ASP approach failed to detect linkage at the inauljn 
^TOTR because the proportion of marker allele sharing in ASPs was close to 0,6, i.e. P^ 0.5 
whether the 5'VNTB. was treated aa bi-aJlcUc or as highly polyraorphio. By contract, the TDT was 
able to detect linkage in the same families because ^ 0.60 (see Spielmau el al 1993). Thus, re- 
gardleas of dUTercnces in relative sample size (».^p,7ji«iJ for xl^ and xlii> the relative magnitudes of 
(P3-O.5) and [Pt-0.5| are often the critical factor that causes a substantial differenco in power for 
Xhv and xi^x- 

It is therefore interesting that analysis of equations (1) and (2) (see below) shows that when 
disequilibrium reaches its mo^t positive value {d^^) or its moat negative value (^^^jj, the 
magnitudes of P^ and P, axe such that: (a) |P,-0.6| and (P,-0.5) rtb both maximized and (b) 
\P,-OS\ > (P«-O.0). This dependence on S of |P,-0.5| and (P,-0.5) also ha« other important impli- 
cations aince the value of P« for a c&mplPMly polymorphic marker is identical to the P, value of a 
bi-allehc marker in equilibrium (d = 0) with a bi-allellc disease locus (see Appendix IV). Therefore, 
if ASP allele sharing for a completely polymorphic marker ia denoted by (P^-O.C)^o, then and 
P, for any bi-ailelic marker in extreme disequilibrium with the bi-allelic disease locus are such that: 
|Pf-0.5! > (P,-0.5) > (P«-0.6Uft. 

To imderstand the pivotal role of ^ in maximi^ng tP,-0.5| and (P,-0..5), and in determining 
their relative magnitudes, consider the three corresponding factoid in P, and P,. By inspection, L, 
-{I W)>Ly^[i^2e)\ and when disequilibrium is present, $ should be near 0 and hence 
i^*^" 7J^' 'J' maximum value of each factor. Furthermore, s.. ^bown below. is substantially 
greater than R,, and the difference between the two fa^tor^ is independent d and 0 since OJid B 
r f r '^^"^f^^^^^^^^ ""^'^^ ^'^^^^ 'c^cuB (7., a. Ay). Therefore, L,>L, and R,>R^, so i^ 
f^lows that |P,^0.5 would always exceed (P.-O.^) were it not for the inEuen^e of tb; remaining 
two factors in equatioMs (1) and (2) (M^ and M^). 

r^^^L^rli ^"'^ l^! ™ denominator aa defined in Appendix I); but the 

numerator of Jtf, .s * - c^e.-cc. whiJe the numerator of Jf. ts the two component, of $ added 
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Comparison of the TDT and ASP teat 163 

togeXhfif (c^c^ + CaCn) implying that > \M^l Since W-i\ reaches its roiiiimum value of 0 at equi- 
librium = 0). while is always posttive, it follows that (P,-0.5) > |Pt-0.6| « 0 in an interval 
of d values around = 0. However^ in Appendix III, I assume that marker allele frequency (w) 
and. djgcaae allele frequency (y) are fixed, and then show that W\\ =M^sitS = d^^^ and at ^ = d^^. 
1 also show that |ilf J and are both maximized at one of the two extreme ^ values (^m»x <^ '^min)- 
Therefore, for any bi-alJelic marker (i.e. any m and p), when ^ equals tf^nx or lPt-0.5| and 
(Pg— 0.6) are maxlmiised since IMJ and M^^ are maxim i7-ed ; furthermore, because and 
Z'l ^g, the greater nnagnitude of compared to drives |P^— 0.5| higher than (Ps— 0.5). 

Why > 

To understand why > P^,, note first that both factors contain three component's that are mul- 
tiplied by the coeflicienta [2p(l -ji)]/16 and {l-p)^/4:, respectively. In P^, each component 
has the form (Z7— 7)" while the corresponding component in P^ is (IP'^V^) where U and V (for the 
components multiplied by yV*» [2p(l— J>)]/16 and arc, respectively, V= a,a+fl,fi and 

V ^ JS^fi'hy,'/, Under the assumption that D/D penetrance exceeds d/d penetrance (a> y) and 
that D/d penetrance (fi) lies Romewbe.t*e between (a>^>7), each component in P^ [IP — V^ = 
{U-^VilU-V)] must exceed its counterpart in P^ t(^-^)*l ainc* {U-hV) > (TJ-V). The only 
exceptions occur when mode of inheritance is dominant (a = fi) or recessive (/ = y) in which case 
one pair of analogous components in P^ andPj^ equal; however, the other two componenta in P^ 
still exceed their counterparts in Pg^, and thus R^ > Pg. 

' To assess how the elevation of Pj above JR^ is influenced by the degree of risk conferred by the | 
disease locus, the risk can be quantified by considering the penetrance of the D/D fcomozygote f p ^ 
(a) to be r timea greater than the penetrance of the d/d homozygote (y). Thus, a = ry and the I L ' J 

penetrance of D/d {fi) can be considered to fall between a and y by letting y? = r+3:(«— y) = I 
7 + a?(r— l)y where » is a number between 0 and 1. Based on this parameterization, the ctiyt 
penetrance ratio (r) can be evaluated for its influence on P^ and Pg by dividing each component in 
Pi by its conterpajTt in P^ which yields the ratios : 

Note that r appears in each ratio only in the term 2/(r— 1) implying that Pi/P, increases mono- 
ttonically as r decreases. Thus, the elevation of P^ above R^ is most extreme for susceptibility loci T| r ^ 
Jgauaing a modest incrc^e in disease risk as indicated by low values of r. J ^' ^ 

In Tables 1 and 2 below, I show values of P, and P^ when r i= 2 and r ==: 4, respectively. In these 

tables, (Pa-0.5) 0 indicating that linkage would be difficult to detect by the ASP approach; but 

|Pi-0.6| is much greater than (P^-CS) when disequihbrium is extreme, thus illustrating that at 

low r values, P^ drives IZ\«-0.5| to levels that provide strong evidence for linkage. 

Power of xIbj^ tind xfc^x 

I now show how to calculate and compare the power of Xm O'^d Xi^t ^hen (a) both tests are 
applied to a bi-allelic marker or (b) the TDT is applied to a bi-allelic marker but the ASP teat 
considers a completely polymorphic marker. I assume the two te^it^ evaluate a series of S randomly 
ascertained parents of one or more ASPs, and that each test considers one ASP per pamnt. In the 
FDiscussion, I explain how to calculate the proportion {H/F) of the S parents who arc informative *1 
I At a bi-allelic marker. (The quantity F is proportional to the population frequency of parents who I r X a/ " 
■ have two or more affected children, and // is proportional to the frequency of such parents who I '^^^ 
jarc also heterozygous at the marker.) Thus H/F determines the sample si^e for xlp and xlai (see J 
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164 R. E. McGmins 

Proportion of AfB pnrenta in aKcertained famaies). For ir«t4i.Tir.c, if both teats evaJuatc the game 
bi -allelic marker^ then sample ais&e 

for AtSbp is ^a^p = W^)^*7> while Bample size for AJuJt "^^ ^^ice aa*7 r^j^f-Ot} 
r large (n^ai = 2(/f/i?')5) sijoce }^^^ counts pairs of transmitted alleleB while Aftat counts individuals ^ 
alleleSr Based on thc^so sample? sdzea (n,«,p,^tdt) ^d the valuea of and the power of and 
are determincsd from the binomial diatributioiiB 



[ 



[ 



and -2£^P^(l-P,r^, 



respectively, e^s explained below. 

Similarly, the power of xt»^ when applied to a completely polymorphic roarkor can also be de- 
termined from the appropriate binomial distribution, but n,ui|) = B since all parents are informative, 
and « 0.5 + (l-2^)H??(l-.?>)/-F]/?3 as shown ui Appendix IV. Interestingly, this expression for 

when the marker is completely polymorphic is identical to for a bi-allelic marker in equi- 
librium with a hi*alle)ic disease locus. This can be veriiied by setting ^ = 0 in = 7rt|;+^, — 
7?i(l— 30) — = (1— m)jp — <y, =» (1— m)(l— 57)+* and substituting for the four haplotype 
frequencies in the e^cpression for H (see Appendix 1) and in equation (I). 

Based on sample size {n^^in^Ci binomial probability (PsjA)* ^^o binomial distributions are 
generated which can be used to calculate the power of ;tjpp and y^^^. ^ described in Appendix II. 
Specifically, the power of or the probability that > (a aigniiicaiice cutpoint) is equal to I 
the portion of the binomial distrihution 

-22^J',M1-P,)'»» for which n,>!^+ ^^''wp-^> . 



I Similarly » if marker allele A is asaociated with disease, the power of "j&^x estimated by the portion | _ 
1 of the bluomial distrihution 

L Jiisi'pn,,i_p^)», for Which «.>2^+^;%i^. £?>3^»o'^ 



Thns, standard tables giving the nomta] approximation to the binomial distribution (Pearson ^| r f/XiK, 0*? 
Hartley, 1954; Weir, 1996) providn precise power values for virtually any sample Hize («„p,»iat)jj ^ 
binomial probability (P„PJ, and trignlfioanoe levol. ^ 

Comparison of TDT and ASP povtcr 

Here I illustrate how the equations for /J, P^ and H/F can he used to compare the power oi xIa\^ 
xW^ assume the two teste oonaidcr markers that are tightly linked « 0) to bi-aJlelic disease ff 
loci with additive mode of inheritance = {a-\-y)/2) fl.nd for which the a:y penetrance ratio is 1 f oJ^^^-Oi* 
r « 2, r = 4 or r = 10. Penetrance ratios of r = 2, 4 and 10 were chosen as being somewhat rep- I 
reaentativo of the entire genetic parameter space since I have found that and increaee rapidly 1 
as r increases from 2 to 6 with smaller, asymptotic increases in and P^ for r > 10. Furthermore, I 
additjve mode of inheritance may also be regarded as being somewhat representative since results I 
from other modes of inheritance do not, in general, substantially differ from results presented here. I 
Id the ta.ble8 below, I compare xla and xL^ when both teste consider the same bi-allelic marker, or 
"w^^e" A'mp considers a fully Informative marker and xlA^. evaluates a nearby bi-allelic marker. Such 
wngle test comparisons would be occasioned by: (a) TDT and AS? analysis of a marker ths.t gave 
^suggestive' evidence of linkage a^d diaea«fi-a«aociatjon in other families or in compansons of allele 
frequencies in ca^gs and ujirelated controls; or (b) TDT and ASP analysis of markers near a can^ 
didate gene suspected of increasing disease susceptibility. 
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In Table 1 (r = 2). Table 2 (r = 4) aiid TftbJe 3 {r - 10), column 1 shows dkeas© allele frequency 
(p) for a bi-allelic diseaw. locus, and columns 2, Z and 4 show reaulta for Xwp applied to a fully 
informative marker. The rcimaininj^ columns in e<M?b table list Xim xlan results for » linked bi- 
aJlelic marker whose ftUelc frequency (m) is listed in column 5. Reaulte are given for each value of 
m (0.76, 0,50, 0.25) aaauming positive difiequilibrmm between the bi-allelic marker allele and disease 
allele is maxinaal (-5 = d^^) or half-maxinoal (S = ^^^) where 9^^^ ^ rain [(l-m)??, (I -p)m]. TDT 
power (two-tailed teat) and ASP power (one-tailed te»t) arc for a aignifioance level of OM and are 
based on a sample size of 200 familiea (i.e. 400 parent-A8P trio8) and thus n^p = 4001H/F) while 

The ASP test can detect linkage over long diatanues {6 > 0) and in the absence of diuequDibrixun 
(<J=0); but the TBT has no power when ^ = 0 and henc^ oan detect linkage only over short 
distanceB (f^enerally less than 1 cM). Yet when disequilibrium is haJf-roaximal or greater {d > iS^), 
Tables 1, 2 and 3 each show that TDT power almost always exceeds ASP power whether xLp ia 
applied to a fully informative Or bi-allelic marker. When f = 2 (Table 1), linkage is virtually 
undetectable by xLp wnce (^»-^'^) < 0.13 and ASP power is 0.10 or lower; by contrast, the TI>T 
is able to detect linkage but TDT power exceeds 0.50 only when $ ia close to and allele 

frequencies {m.p) are similar in magnitude at the marker and disease locus. For r = 4 (Table 2), 
ASP power is increflsed but still relatively low (^0.33) for fully informative markers and for most 
bi-allelic markers. TDT power is also substantially higlier and, for most marker*i, exceeds 0.05 when 
* ^ - ^m.Lx a«cl exceeds! 0.60 when d = id^^Ti^ thus indicating that when r = 4, the TDT could dem- 
onstrate linkage to many disease loci whose linkage might be diificttlt or impossible to establish by 
the ASP test. 

Por r^iO (Table 3)> TDT power is reasonably high (^0,66) when S > and ASP power Is 
also elevated (> 0.50) except at the highest. disesUBe allele frequency shown {p = 0.6) where ASP 
power ia 0.2S for a fully informative marker. Thus as r increases from 2 to 10, the tables show that 
and ASP power increase substantially and hence, when r = 10, the relative power advantage of 
the TDT is diminished. Nevertheless, as indicated by lower ASP power when p = 0.6 at r ^ 1.0 
(Table 3), ASP power at elevated disease allele frequencies (p > 0.6) remains low (< 0.50) even 
when r-*^ 00 (data not shown: table available from the author). For example, if the same power 
analysis shown in Tables 1-3 were conducted for r = co (i.e. y *- 0) then for a fully informative 
marker and disease allele frequency of ;j = 0.76, and ASP power would be 0.519 and 0.19, re- 
spectively. By contrast, TDT power would be much higher (^0.80) bub only when ^ > ^^m^x ^"^ 
m is close to = 0.7.5 (i-o. 0.65 0.85). 



In concluding this section, I emphasdze that Tables 1-3 show that when the disease locus and 
marker are bi-allelic, TDT power is Hubstantially increased if the disease allele and positively 
associated marker allele have similar frequencies, Muller-Myshok & AbeJ (1997) independently 
made a similar observation, but they emphasized the weakness of TDT power when the m/p ratio 
departs from unity and $ is not close to S^^^. However, the tables illustrate that similar frequencies 
for the disease aJlele and associated marker allele can increase TDT power to reasonably high levels 
even when the m/p ratio substantially differs from 1 and 3 is much lower than dj^^. For example, 
in Table 3 (r = 4), note that when d = ^S^,^^ and p = 0.15, a similar frequency {m = 0.25) for the 
diseaee-associated marker allele produces TDT power of 0.86 and of 0.581; but when p = 0.15 i 
and m = 0.5 at <y = i<5^,^, TDT power and P^ fail to 0.63 and 0.547, respectively. The difference in I 
TDT power for these two situations can also be quantified by calculating the mean value of xUi 
based on a sample of 200 ASP familiog and the values of P^ and H/F in Table 4 [i.e. xiax = 
SO0{H/F) (aPt- i)% When p ^ 0.16 and wt = 0.5, xldt = 3*^3 yielding a significance level of p 
0.06; but when p = 0.15 and w = 0.25, Xuli » 02 for a significance level of p < 0,003. The large 
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168 R. E. McGinnis 

difference in significance level (0.06 versus 0.003) and power (0.63 versus 0,86) illustrated by this 
example indicEitea that careful attention to allele frequencies at bi-AJleiic markers may play an 
ijnporta^nt role in future cflTorts to map susceptibility loci. 

DJSOTTSSIOF 

The equations for i^, and IJIF enable comparison of TPT and ASP power for the same family 
data, since the three expressions as-sujue random ascertainment of parentis of i-mo or more affected 
"children. However, the TDT can be applied to families with a single affected child, so in Appendix 
1, T derive an expression analogous to (denoted FX) whicli gives the probability that allele A was 
transmitted, to aji affected child by a randomly ascertained AfB parent of owe or more affected 
offspring. The derivation of Pf is almost identical to that of Z^, and hence the algebraic form of | ^oC, \^ 
is aimilftr to that of and ' ^ 

Equation 3 

where g» = 2(c,c^-hCaC3) ^^~^'*"^"'"'^ +2ctea{pa-y^+/y} + 2c,c,{p^-yy + y} 

Previotts aneUya^ of TDT potjoer are special casee of iht <mrrmt analysis 

I now show that two previous analytic investigations of TDT power (Terwilliger & Ott, 1992; 
Risch & Merikangas, 1996) are special cases of the curitjnt analysis. These two analytic 
investigations and a third by Sham & Curtis (1995) as well as simulation and computer-based 
analyses of TDT power (Schaid & Sommer, 1994; derget-Darpoux et al, 1995; Kaplan et al. 1997) 
all assumed the disease locus to be bi-allelic. Sham & Curtis (1996) and Kaplan et aL (1997) con- 
sidered a multi-allele marker locua, but the other analyses assumed either a bi-allelic marker or a 
direct test of the disease polymorphism itself, and each analysis examined TDT power for one or 
several specific modes of inheritance. 

Terwilliger & Ott (1992) considered a TRocsaive disease with no phenocopies in families 
ascertained through a single affected child. By investigating the same reeosalve model, Ott (1989) 
had previously derived an algebraic probability for transmission of each marker allele (denoted H 
and h) to afTected offspring by heterozygous H/h parents and by both types of homozygous parent 
(see Ott's table II), Thus, power results for the TDT (McNemar's teat in frgure of Terwilliger & 
Ott) can be derived by using Otfa table II to compute for the recessive, zero-phenocopy model. 

is derived by considering only the two probabilities in Ott's table for heterozygous H/h parents, 
and by dividing the transmission probability for allele IJ by the sum of the probabilities for allele 
n and allele h to yield: 

^ {m^S/p){i^m)-^m[{l'^m)-d/py 
where, substituting my notation for Ott's, wis the frequency of marker alleJe 11 (or, alternatively, 
my allele A), p is the frequency of disease allele D. and. 8 is the coefficient of rUsequilibrium. This 
expression for Pf is seen to be a special case of equation (3) by making the appropriate penetrance 
substitutions (a > 0,^ ^ y = 0) into my expressions for Pf and and by expressing c„ c,, c, and 
c, in terns of p and d according to sta^ndard exprea^ions «iven above (see General algebraic 
model of linkage). 

Biaeh & MerikBoigfts (1996) compared TDT Mid ASP power for an intermediate mode of irherit- 
.ince .n which D/d ponetnuioe (fi) Ia a multiple (A) of d/d penetrance and r « kK For their MaJy^ls. 
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Com.paHson of the TDT and ASP iesi 169 

the TDT was aBsumed to teat the rligonsc locus itself or a perfectly associated bi-ftHcUc marker 
ajid, under this assumption, Risch & Merikongas found that jP* = /J = With appropriate' 

substitutionB « 0,a = k^.fi ^ /cy,Ci = ^,€4 = 1— p, = c„ = 0), equations (2) and (3) for and" 
Pf nho girriplify to + A), thus agi-eeing that for this particular model, Pi and P* are (a) rdt^ntioal 
and (b) independent of diaeaae ajlele frequency. For many other genetic modela, P^ and do 
^ appear to be similar in vahjc (tbo\igh not identical). However, graphical analysis of P^ showA that« 
the value of is independent of disease allele frequency only for the particular mode of inheritance 
considered by Risch & Merikangas (graphs showing this are available from the author). 

The expression in Risch & Merikangas (1006) for Pf^ (their Y) when a marker is fully informative 
can also be shown to be a apeciaj cose of equation (1) for P^. This is verified by substituting the 
appropriate mode of inheritance parameten^ {a = k^y^fi^ ky) into equation (1) and also by sub- , 
atitiifcmg parameters for a closely United marker jjo equilibrium with the disease locus (i.e. $ = Q 
and S - 0 in ^ mp'h 8, =^ m(l—p) — d, Cg « (^i — m)p—(J and = (1 — m) (1— ^>) + <J). In Appen- 
dix rv and Results (see Power of and xiddi ^ showed that when a bi-allellc marker is in equi- 
librium with the disease locua, equation (1) for P^ is identical to the expression for P, when a marker 
is fully informative as was assumed by B'lsch & Merikangas (1996). 

PropoHion ofA/B par mis in aswrtained families 

Since power analyses often calculate power for a speclEc number of asc^irtained families, the 
proportion of informative A/B parents in such families must be calculated to determine the subset 
of parents to which the TDT or ASP test is applied when a marker ig bi-allelic. Among parents 
ascertained through one affected child, it can be shown that the expected proportion of A/B 

//V-f'* where JT* Is aa previously defined (see equation (8)) and « T s^X" i 
^ i>'a+2?7(l-p)^+(l-je>)2y. Similarly, the expected proportion of A/B parents among those J '^''^ 
aeoertained through an ASP can be shown tK5 be II/F where // is as defined in Appendix I and 



32^. »3 



With appropriate substitutions (a = /c^^,^ = Ajy), the expressions H^/F"^ s^nd H/F reduce to tlie 
corresponding expressions given by Risch & Merikangaa (1996) for the proportion of heterozygous 
parents found in families having at least one and two affected children, respectively. Furthennore, 
for the recessive, zero-phenocopy disease considered by Ott (1989), the sum of the two probabilities 
for heterozygous parents in Otfs table TI gives the proportion of heterozygous parents in families 
Qficertained through a single affected child. When appropriate substitutions are made (a > 0,/? - 
y = 0), the expre.8sion ^/iP* also reduces to the proportion of heterozygous parents predicted by 
Ott\. table. It is also important to note that Sham & Curtis (1995) derived a table of probabilities 
aiia ogous to Ott'R table 11, except that their tahJe 3 ha^ entries for a variable number of marker 
alleles and their probabilities describe a genera] model of disease. If t^ble 3 of Sham & Curtis is 
assunved to have only two marker alleles, then the two probabHities for heterozygous parents pre- 
dict a probability of allele torai^amission identical to Pt (equation (3)) as well as a proportion of 
beterozygoua parents in ascertained families which is Identical to E^/F"^, 
Poimr of Xtai for a mttlH-alkHc marker 

^ h^plotype frequencies (c„c„03,c,) have represented a bl-allelic marker linked to 

Lt ^l r uT' ^'^V^T/''^^'"'^"" ^^^^ ^'^^ correspond to any two marker alleles 
(«i,a,) of a multi-allehc marker linked to a bi-aUelie disease locus [c, «=/(o,D),c, -/(a,rf) c,^ 
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E, McGinnis ^ 

f(aT»r - JXa d)l Th« expres8mn for H^F* (or H/F) would then be the proportion of '^'^f^''^^ 
birozygo.. for a, and a, and the exp^^sion for (orT.) would he the 
LXonafprobability that a./., parent. tra...nit allele a. to ^^^^f^^^^^ 

the expression, for Pf (or PJ iu,d H-fF^ (or ^/i') couJd be used to investigo.te the power of any 
•^trateffV for applying the TBT to a mttlti-allelic raarker. ...... 

ulSlZ^^^ a strategy ..comn^ended by Spielman & Ewen. (1996) in which xf.. - 
calculated for each allele i of a multi-allelic marker (i = X in k) by evalua.|ng P^^^^^^^^^ 
for allele i and the other alleles grouped together (non-<). The marker ,3 then tcBted for Unka^e by 
evaluating the «tati3tica.l significance of the largest of the k xtit^ ^sing significance outpomtB 
adhjfited for multiple testing Mid n on -independence of the chi-squares (see Ewen^ & Spielman 
f 1097] for a table of these cutpoint8>. The power of this procedui^ can be estimated for any raultv 
allelic marker raodel aa follows: For each i/nonW determine the haplotype frequencies Ci.^,c,,c, 
and calculate the associated values, of Pf and ff^/F* (or P„ H/F aj^dPj. Then determine the ./non- 
i likely to give the highest by calculating the expected value of each xIax [^(Afiav)]- (For S 
pa^ieni^ of an ASP, it can be shown that Eixl..) = 2(£f-l)(2P,-l)*+2n while for singletons, 
^^^j^^) ^ + For the i/nonW giving the highest E{xl,), TOn power would then 

be det^.rmined exactly as for a bi-allelic marker (see Power of xla ^r^d ^Lp) except that an adjusted 
significance cutpoint would be used aa described above. 

To briefly examine power for a particular multi-allelic example, consider the bi-allelic marker 
and disease locus m the bottom line of Table 2 (r = 4). The frequencies {p and m) of disease allele 

D iM<l i><>oi*ivBly «9*<ioiAtod m<^nlrop allolo JL oro O 1.5 nr^A ft 9*i, rpsjw^f^.iv^ly. and TT)T pnwf.r ia 0.99 

when d => and 0.86 when d = Suppose p and m remain constant as does the degree of 

positive association between alleles ^ and D (i5 « S^^ or \8^) but suppose the marker consists of 
k- 1 additional (non-A) alleles having negative or no association with disease allele 0. Then A/non- 
A would give the highest E(xlii) of any i/non-i and thus TDT power would be determined by the 
and n/F ehown in the bottom line of the table. According to Ewen* & Spielman (1097), adjusted 
outpoints (0.05 aignifioAnee) for Jb « 2, jfe « 4 and = S are xt^i = '^-S*. ^'^^ ''•^1* respectively; 
thus TDT power when fc = 2, 4 or 8 would be 0.86, 0.71 or 0,6.1. at /i ^ and would be 0.99 for 
ik ^ 8 when 8 = This example suggests that TDT power for a multi-allelic marker remains 
relatively strong if (a) one marker allele is strongly associated with either allele of a bi-allelio disease 
locus and (b) the two associated sJleles have similar population frequencies. 

CoTidvding reTnurkji 

Strength of evidence for linkage provided by xLn Tdat critically depends upon the magnitude 
of departure from the null hypothesis value of 0.5» the size of departiu?e being quantified by 
(Pa -0.5) and tPt-0.6| for the ASP and TDT paradigms, respectively. In thw paper, I have shown 
that (Ps-0.5) and lPt-0.5| are each a product of three corresponding factors [(P,— 0.5) = 
LgA/ft-RsJPt— 0.5| =sXt|MJi2t]. and it depend only on the recombination fraction (6), JZg and 
depend only on disease penetrance (a, fi, y) and the frequency {p) of the disease allele and, fur- 
thermore, marker allele frequency (m) and disequilibrium {$) influence only and |AfJ. Hence, 
the corresponding factors in (P,— 0.5) and IP^— 0.5| fa.cDit^.te comparisons between the ASP and 
TDT paradigms, and also enable some 'partitioning' of the contribution to evidence for linkage 
provided by standard genetic variables such as 0, 5, m, etc. 

Together with the expression for parental heterozygosity at the marker IH/F)^ the expressions 
for 7^ and P^ provide a general framework for calculating and comparing the power of ;^5«p and xf^^. 
This framework generalizes the ASP-TDT comparison of Risch & Merikangas (1996) by cncom 
passing rasony modes of inheritance rather than jisst one, and. also by enabling TDT power 
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Comparison of ths TDT and ASP test 171 ^ll^r^ I'sQi 

I calculated for a msrker that isi distinct from the disease locca. Analysis of tho equations shows J L- 
l that TDT power is groatly increased if disequilibrium is strong and if the disease allele and pOBi- 
tively a<jsoci8.ted marker allele have aiinilar population frequenojea. The equations al^io show that 
the superior power of the TDT comp&ted to the ASP teat Isi greatest when Hueceptibility looi confer 
TTiodest diaeaae risk, aa indiv^ted by low vahies of the penetrance ratio r. When a marker is strongly 
associated with a disease locus that oontribut^a modest disease risk, [P^— 0.6| > ^ 0. Thus, 

the TDT ia likely to play an important role in detecting and replicating linkages to lod responsible 
for complex gonetic disease. 

T am deeply grcitcf-ul tu Ricliard S^i&Iftian fur cn<;t>urAgCffncnt and valuable auggestione as this work dnvQlopod. T 
am alau indebted to Wamn £!wen« for valuable cummenta and for crJticJsim that improved t.hc mAJiiincripti. Thin 
research woa supported by NIH grants DK466t8 and DK47481 and by grant 193189 from the Jiivi^Tiilo DUhntw 
Foundation. 
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APPRNPIX I 

Derivation of eirpre$8io7ut for P;, 7>, H 

The dovfvationa oseume the general modol of a bi-aUelic roorker ajid linked bi-allelio dieeaae locus 
that ia the only loous that underlies disease susceptibility (gee General algebraic model of linltage 
in the vDsixi text). I begin the derivation of and (equationa (!) and (2)) by first deriving 
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mating type in Table this totw glmplificf? to the overall * weighted' subpopul&tjon frequency 
shown in column 2, For example, when k = 2 (and hence /„ = f{r—l)/(iV^—l)), the sum for the 
mating type in line 1 simplifica as follows: 



r 








2 


r- 






2 








2 


2 


a+fi 


1 



]Ba.sed ox) the 'weighted' suhpopulation frequencies in column 2, what then is the prohability of 
randomly ^electing a particular *AfB father-mating type* frona among famjiie? with an AfB father 
and i\r children, at least two of whojn are alFeuted ? Setting ^ = 2, the probability of randomly 
pelpoting a particular mating type would be the weighted frequency of the mating type divided by 
the sum of all the weighted frequencies in column 2. Thus, the probability of randomly selecting 
the mating type on line 1 (AD/BdxD/D) would he: 

whei^ //= 2(c,c,-hc,C3) ^ \W-P) ?LtM±l i±l 

+ Catp V + fe»(l-p)(a-f-^«-h(l-p)=^} 
+ 2r:, C^lp^fi'-hipi^ -p) (^4-7)» -h {.I 

Note that the quantity ^ cancels in the numerator and denominator of thus demonstrating 
that the probability is independent of family size (N) and hence applies to mixed populations of 
families of any size that have 2 or more aifeotied oflfepring. Furthermore, if mating types with A/B 
mothera were included in Table A2, the only effect wouJd be to double each frequency in coluron 2; 
but would be unchanged since the •2's' in the doubled numerator and denominator of .Ppe would 
cancel. Thus, by setting k^2, the/»^ calculated for each mating type in Table A2 applies to random 
selection of ^1/^ parevM of two or more affected offspring jfrom families of any size. 

What, then, is the probability that a randomly selected A/B parent of a particular mating type 
transmitted allele A (allele B) to en affected child? For eaeh mating type in Table A2» the two 
rightmost columns show the conditional probability that the A/B parent transmitted aJlele A Of B 
fo an individual affected offspring. These probabilities follow directly from the conditional proba- 
bilitiea in Table A L For example, in the AD/Bd x D/D mating in line i of Table A2, the A/3 parent 
ha* allele^ in coupling with allele and B in coupling with d, end thus Table Al implies that A 
is transmitted to affected offspring with a probability of 
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